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The classical theory of rank-based inference is entirely based ei- 
ther on ordinary ranks, which do not allow for considering location 
(intercept) parameters, or on signed ranks, which require an assump- 
tion of symmetry. If the median, in the absence of a symmetry as- 
sumption, is considered as a location parameter, the maximal invari- 
ance property of ordinary ranks is lost to the ranks and the signs. This 
new maximal invariant thus suggests a new class of statistics, based 
on ordinary ranks and signs. An asymptotic representation theory a 
la Hajek is developed here for such statistics, both in the nonserial 
and in the serial case. The corresponding asymptotic normality re- 
sults clearly show how the signs add a separate contribution to the 
asymptotic variance, hence, potentially, to asymptotic efficiency. As 
shown by Hallin and Werker [Bernoulli 9 (2003) 137-165], condi- 
tioning in an appropriate way on the maximal invariant potentially 
even leads to semiparametrically efficient inference. Applications to 
semiparametric inference in regression and time series models with 
median restrictions are treated in detail in an upcoming companion 
paper. 

1. Introduction. The classical theory of rank-based inference is entirely 
based either on ordinary ranks or on signed ranks. Ranks indeed are maximal 
invariant with respect to the group of continuous order-preserving transfor- 
mations, a group that generates the null hypothesis of absolutely continuous 
independent white noise (no location restriction), whereas signed ranks (i.e., 
the signs along with the ranks of absolute values) are maximal invariant 
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under the subgroup that generates the subhypothesis of symmetric (with 
respect to the origin) independent white noise. 

Now, in most statistical models a location parameter for the error term 
is usually specified to be zero: regression and analysis of variance models, 
stationary autoregressive moving average (ARM A) models and so on. Sym- 
metric white noise allows for such an identification, at the expense, however, 
of a symmetry assumption that in practice is often quite unrealistic. In ad- 
dition, the trouble with independent white noise without further restrictions 
is that it does not allow for identifying any location parameter. 

This location parameter in most applied work is the mean — a heritage 
of Gaussian models — but could be the median as well. Zero-median noise is 
certainly as natural as zero-mean noise. In a semiparametric context, it is 
even more satisfactory, because it does not require any moment assumption 
on the densities under consideration. Median regression and autoregression 
models have, therefore, recently attracted much attention: see, for instance, 
[12, 14, 15, 17, 21], to quote only a few. Moreover, from the point of view 
of statistical inference, the assumption of zero-median noise is also more 
convenient, since it induces more structure. The hypothesis of zero-mean 
white noise indeed is not invariant under any nontrivial group of transfor- 
mations, so group invariance arguments cannot be invoked in models that 
involve zero-mean noise. The situation is quite different for the hypothe- 
sis of zero-median noise, which is generated by the group of all continuous 
order-preserving transformations g such that g(0) =0. A maximal invariant 
for this group is the vector of ordinary ranks, along with the vector of signs. 
Hallin and Werker [11] have shown that, in such a situation, semiparametric 
efficiency is achieved by conditioning with respect to a maximal invariant. 
Maximality of the invariant here is essential: conditioning, for example, on 
the ranks when the signs and ranks, not the ranks alone, are maximal in- 
variant generally induces an avoidable loss of efficiency. 

Invariance and semiparametric efficiency arguments in such models thus 
lead to the new concept of sign-and-rank-based statistics, which involve both 
signs and ranks. This new concept is more natural than the traditional rank- 
based one in all models that include a location (intercept) parameter, but 
also in models such as stationary ARMA models, where the noise is inher- 
ently centered. The objective of the present paper is a detailed study of 
the class of linear sign-and-rank statistics for which we provide Hajek-type 
asymptotic representation and asymptotic normality results. These results 
readily allow for building new rank-based tests for a variety of problems 
in one-, two- and /c-sample location, regression, ARMA and related models 
without making any symmetry assumptions on the underlying error den- 
sities. They also form a basis for the construction of semiparametrically 
efficient procedures in median constrained models (see [10]). 
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The paper is organized as follows. Section 2 briefly introduces several con- 
cepts of white noise: independent, independent with zero mean, independent 
with zero median and independent symmetric white noises. We recall how 
the invariance principle for each of these concepts, but for white noise with 
zero mean, leads to a different concept of ranks and/or signs — the right con- 
cept for median-centered white noise being the signs and ranks. Sections 
3 and 4 propose a systematic investigation of (linear) nonserial and serial 
sign-and-rank statistics. These new statistics, which are measurable with 
respect to the vectors of ranks and signs, are studied along the same lines 
as the classical linear rank statistics (see, e.g., [3] for the nonserial context; 
see [5] and [7] for the serial context) and the linear signed-rank statistics 
(see [3] and [13] for the nonserial context; see [7] for the serial context). 
However, the nonindependence between the ranks and the signs (in sharp 
contrast with the traditional context of signed ranks, where the signs and 
the ranks of absolute values are mutually independent) requires a more del- 
icate treatment. Section 5 concludes with an empirical study: simulations 
very clearly show that the proposed procedures quite significantly outper- 
form their classical counterparts based on either parametric correlograms 
or traditional ranks — the more skewed the underlying densities, the more 
significant the efficiency gain. 

2. White noise and group invariance. 

2.1. White noise and semiparametric statistical models. Whatever the 
concept of ranks, rank-based inference applies in the context of semipara- 
metric models under which the distribution of some observed n-tuple Y^ n ^ := 
{Y\ \ . . . , Y^ )' belongs to a family of distributions of the form 



where 6 denotes some finite-dimensional parameter of interest and / de- 
notes some unspecified density (densities throughout are tacitly taken with 
respect to the Lebesgue measure over the real line) that plays the role of a 

(n) 

nonparametric nuisance. This distribution Pj-.g, in general, is described by 

means of (i) a residual function, namely, a family {3g 5 $ £ 0} of invertible 
functions indexed by n and 6 that map the observation Y 1 -™) onto an n-tuple 
of residuals 



{/ : f{x) > 0,x G M} as the set of all nonvanishing densities over the real 




{P^fleeci^/ef}, 
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line, let J-* := {f E J- : /f^ zf(z) dz = 0} be the subset of all densities in T 

with zero mean, let Tq : = {/ G T: J- OQ f(z)dz = f(z)dz = 1/2} be the 
set of densities in T having zero median and let JF + := {/ E T:f{—z) = 
f(z),zE M} be the set of densities in T that are symmetric with respect to 
the origin. Denote the following terms: 

(a) Independent white noise: Let Ti^ denote the hypothesis under which 
the random vector Z( n ) = (z[ n \ . . . , Z n n ')' is a realization of length re 

of an independent white noise; that is, Z^, i = 1, . . . ,n, are i.i.d. with 
density / G T . 

(b) Zero-mean independent white noise: Let Ti^J denote the hypothesis 
under which Z^ n ) is a realization of length n of an independent with 

(n) 

zero-mean white noise; that is, Zf \ i = l,... , re, are i.i.d. with density 

(c) Zero-median independent white noise: Let Ti^yJ denote the hypothesis 
under which is a realization of length n of an independent with 
zero-median white noise; that is, z\ n \ i = 1, . . . , re, are i.i.d. with density 

(d) Symmetric independent white noise: Let Ti^j denote the hypothesis 
under which Z( n ) is a realization of length n of an independent symmetric 

(n) 

white noise; that is, Z\ , i = 1, . . . ,n, are i.i.d. with density / G T+. 

The notation TL^ n \ H.[ n \ TLq 1 ^ and Ti^ is used whenever the underlying 
density function / remains unspecified within T , JF*, !Fq or respectively. 

(n) 

In practice, of course, the role of the random variables Z\ is played by 

the residuals z\ n \9) (i = 1, . . . , re) associated with a specific value of the 
parameter in the statistical model under consideration. 

The independent white noise hypothesis H.^ is most general, but does not 
allow for identifying location parameters. A classical attitude, when location 
is to be identified, consists in assuming that the underlying white noise 

(n) 

density has zero mean, that is, adopting Til ■ As already explained, an often- 
used alternative solution requires the median (instead of the mean) of the 

(n) 

white noise density to be zero, leading to TCq . The additional assumption 

In) 

of symmetry yields 7i + . 

2.2. Group invariance: ranks, signed ranks, and signs and ranks. Let 
£ (n) ._ (tngn^W := {pg, e 0, / G F}) be characterized (in the sense 

of Section 2.1) by the residual function 3g and the white noise concept 
Denote by & the set of all continuous, strictly monotone increasing func- 
tions g : M — > K such that lima,— >±oo9( x ) = ioo, define Qg : z = (z\, . . . , z n )' G 
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W 1 1 ► Qg n \z) := (g(zi), . . . ,g(z n ))' G W 1 and consider the group (acting on 

R n ) 

Q ( e\°--={{lt ) T l °Gt ) ^e\9£®},°. 

This group (known as the group of order-preserving transformations of 
residuals) clearly is a generating group for the fixed-0 submodel £^ n \6) := 
(R n ,B n ,V( n) (0) := {P { e n) f , f G J 7 }) of£( n \ with maximal invariant the vector 

RW(fl) := (^(fl),...,^^)/, where i?S n) (0) denotes the rank of the 
residual z\ n) {0) among z[ n) (0), Z^\d). 

Similarly, let + := {g G <55 2) = — g(z)} and denote by the cor- 
responding subgroup of G { q \ This group (the group of symmetric order- 
preserving transformations of residuals) is a generating group for S { l l \e) : = 

(R n ,B n ,V ( +\o) := {Pg™|, / G the submodel of £^(0) that results 

from restricting to symmetric densities / G J-+. A maximal invariant here 

is the vector ] (6) := (sj* (0)^(0), . . . , sl n) (0)^(0))', where #^(0) 
denotes the rank of the absolute value | zf 1 ' (0) | among | z[ n ' (0) | , . . . , | Z^ (0) | 
and where s ^ (0) is the sign of z\ n \e). 

Turning to the model £^ n) := (R n ,B n ,V^ n) := {P^ f , G 0, / G JF }) char- 
acterized by the residual function 3g n ^ and the zero-median white noise 
concept TtlyJ, it is easy to see that a generating group for (with obvious 

notation) £ q 1 ^ (0) is obtained by considering the subgroup of Q { e ] that cor- 
responds to 0o := {g G <3:g(0) = 0}, with maximal invariant the vectors 
s (")(0) := (s[ n) (0),...,si n) (0))' of residual signs and R( n )(0) of residual 
ranks. 

Provided that the parameter contains a location or intercept compo- 
nent, and leaving aside the condition that residuals should have finite first- 
order moments, the model «?i n) := (R n ,B n ,V*^ := {Pj$,0 € 0, / G JF*}), 
which is characterized by the same residual function 3g as £q , but has 

zero-mean rather than zero- median white noise, coincides with £q . Both 
models indeed involve the same family of distributions over (R n ,B n ); 
they only differ in the way the nonpar ametric family is split into a 
collection of parametric subfamilies := {P^g,0 G 0} (hence, of course, 

in the way is to be interpreted). Rather than two distinct models, Eq^ 

(n) 

and £i thus constitute two different parametrization of the same model, 
but the invariance structure underlying £q^ is not present in E^ . The me- 
dian, in this respect, allows for a richer structure and, therefore, seems more 
appropriate than the mean as a location parameter. 
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2.3. Group invariance and semiparametric efficiency. The importance 
of considering maximal invariants — thus, signs and ranks in models with 
zero-median white noise — has been substantiated by Hallin and Werker [11]. 
Their paper showed that, in a very broad class of models, semiparametrically 
efficient inference procedures can be obtained by conditioning with respect 
to a maximal invariant cr-algebra. 

More precisely, assume that the semiparametric family (2.1) is such that: 

(i) For any fixed /, the parametric subfamily := {P^g,6 £ 0} is lo- 
cally asymptotically normal (LAN), with central sequence (0). 

(ii) For any fixed 0, the nonparametric subfamily := {P^,/ € J 7 } 
is generated by a group of transformations with maximal invariant 
WW(fl). 

Then, under very general conditions, semiparametrically efficient inference 
(testing, estimation, etc.) at / can be based on the semiparametrically effi- 
cient central sequence E[Aj. n) (0)|W( n )(0)], which, moreover, is distribution- 
free under V^. Projecting onto maximal invariant a-algebras (generated, 
in the context of Section 2.2, by the ranks, the signed ranks or the signs and 
ranks) thus yields (at given /) the same results as tangent space projections. 
In a companion paper [10], we specialize the Hallin and Werker [11] abstract 
results to obtain semiparametrically efficient inference in median regression 
and autoregressive models using the asymptotic representation results of the 
present paper for general sign-and-rank statistics. 

Inference based on ranks and signed ranks has since long ago made its way 
to everyday practice and even to elementary textbooks. A pretty complete 
toolkit of rank-based methods is available for the analysis of linear models 
with independent observations (see [4, 18] for a systematic account and the 
state of the art in this context), as well as for the analysis of linear time series 
models (see [2, 5, 6, 7, 9]). It is somewhat surprising, therefore, that sign- 
and-rank statistics never have been considered so far in the vast literature 
devoted to that subject. The purpose of this paper is to fill this gap. 

2.4. Two simple examples. Two examples are treated in some detail in 
Sections 3.4 (median regression) and 4.4 (median moving average), respec- 
tively. 

Under the median-regression model, observations are of the form 

(2.2) Yi = 0i + 6 2 cf ] + Si, i = l,...,n, 

where 6 := (61,62) £ IR 2 , the c\ s are regression constants and the Ej's are 
independent and identically distributed (i.i.d.) with density /. Instead of 
the usual specification that E[ej] = 0, however, we rather impose that the 
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median of £j is zero (i.e., / £ Fo). Here, the residuals take the form Z^ n \0) := 
Yj — — 02c\ n \ Under P^g, these residuals are i.i.d. with density / G .Fo- 
under fairly general conditions, this model, for fixed / (with weak derivative 
/'), is LAN with central sequence 

n _ ft , 1 

(2.3) - n-^E-fC^W) ( C W 

i=l ^ * 

In the first-order median moving average (MA) model, observations are 
generated by the MA equation 

(2.4) Y t = £ t + 9e t -i, t = l,...,n, 

with 6 G (—1,1). Here again, we assume that the e^'s are independent and 
identically distributed with density / and median zero. For simplicity, as- 
sume £o = 0. The residuals are defined recursively as z[ n ^ (6) :=Y t — 0Z^\ (6) , 

with initial value Zq(G) = 0. Here again, for fixed / (with weak derivative 
/'), LAN holds with central sequence 

(2.5) Af\0) :=n-V3£^(zJ">(fl))z£>(*). 



t=i 



f 



2.5. Sign-and-rank statistics. A sign-and-rank statistic is an (s^,R^)- 
measurable statistic, where s( n ) = (s^\ . . . , s^)' and RW = (R\ , . . . , R^)' 
are the vector of signs and the vector of ranks, respectively, associated with 
some n-dimensional random vector Z^ n \ The objective of this paper is to 
introduce linear nonserial (Section 3) and linear serial (Section 4) sign-and- 

(n) 

rank statistics, and to study their distributions under TCq . 
Denote by 

n n 

:=£l[z{ B Uo] = X><"> = -l] 

i=l i=l 

and by 

n n 

N^:=J2l[Z^>0]=J2l[s^ = l] 

i=l i=l 

the numbers of negative and positive components in Z^ n ) (in s^ n ^), respec- 
tively. Under , iV~j n) is binomial Bin(n, 1/2). Letting N< n > := (ivi n) , ) , 
note that a(N^) = a(ivi n) ) = o-(iV"j n) ), because iV"j n) = n - with prob- 
ability 1. Since 4 n) = I[z\ n) > Q]-I[z\ n) < 0] = I{R^ > n-N^]-I[R^ < 
N^] for all i = l,...,n, the couple (N( n ),R/ n )) is maximal invariant for 
n . 
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Defining the sets 

AA W :={iG{l,...,n} :Si (n) = -l} = K<...<r (n) } 

and 

< n) :={,G{l,...,n}: Si W = l} = {f+<...<^ ) }, 

the distribution of (s( n ),R,( n )) under Hn is conveniently characterized as 
follows: The marginal distribution of s^ n ^ is uniform over the 2 n elements 
of {—1, l} n and the conditional distribution of R( n ) given s( n ) is such that 
{R [n) , R {n J R {n J ; , R [r l ] ) is (conditionally) uniformly dis- 

tributed over the (iV_ !)(JV+ !) possible combinations of a permutation of 

{1, . . . ,N_} with a permutation of {(n — N+) + 1, .. . ,n}. 

(jv£ n) ) (Ar (n) ) 

Let us finally denote by Z, ^_ and Z, the vectors of order statis- 
tics associated with the negative and positive elements of ZW, respectively. 
These two vectors — the first one of length N_ and the second one of length 
— constitute a natural (random) decomposition of the vector of order 
statistics Zq associated with Z( n ). 

3. Nonserial linear sign-and-rank statistics. 

3.1. Definition and conditional asymptotic representation. A linear non- 
serial sign-and-rank statistic is a statistic of the form 

i n 

(3.1) :=-^ C! (n) aW(NW;R t (n) ), 

where a^ n \-; •) is a real-valued score function defined over {((f, Tj);i) :u,rj E 
{0, 1, . . . , n}, rj <n — u, i S {1, . . . , n}}; note that each summand in (3.1) is 
allowed to depend on the sign s[ n ^ of z\ n ^ , but also, via N^") , on the other 



signs, but not on the other ranks. As usual, the c- n ^'s (i = 1, . . . , n) denote 
nonrandom regression constants. 

The exact mean Efsi"^] and the exact variance Var[£c ] of Sc under 

In) 

TCq are easily obtained from elementary combinatorial arguments: Letting 
g(n) . = n -i t>) , we obtain 

i=li/=0 ^ ' 



SIGN-AND-RANK STATISTICS 



9 



and 




respectively. 

If asymptotic results are to be obtained, some stability of the scores 
is required as n increases. We therefore assume the existence of a score- 
generating function. A function ip : (0, 1) — ► R is called a score- generating 
function for the score function a^> if 

(3.2) E[{ a ( n \N^;R^ ) )-ip(F(z[ n) ))} 2 \Z < ^} = o P (l) 

under H-Q.j, as n — > oo. Here F denotes the distribution function associated 
with density /. Note that, by the rule of iterated expectations and the fact 
that N( n ) = (N_ ,N+) is measurable with respect to a sufficient 

condition for (3.2) to hold is 

(3.3) E[{a( n )(N( n );4 n) )" v(^(^{ n) ))} 2 |N (n) ] =o P (l) 
under Hq!^, as n — > oo. 

(n) 

No asymptotic results for 5c can be obtained without some assump- 
tions on the asymptotic behavior of regression constants c| , i = 1, . . . ,n. 
We assume that the classical Noether condition holds: 

(N) The constants c\ n \ i = 1, . . . ,n, are not all equal and 

m)1Y ,... (Jp) _ dn)\2 
maxi<j< n ^Cj — c v ) 

hm ~ ~ , V — = 0. 

We may now state a first asymptotic representation and asymptotic nor- 
mality result. This result, however, is a conditional one in the sense that 
the centering in (3.4) and (3.5) below is a conditional centering. Since, con- 
ditionally on the signs, the sign-and-rank statistic (3.1) reduces to a purely 
rank-based statistic, this conditional representation result follows from clas- 
sical results on linear rank statistics and merely serves as an intermediate 
step in the derivation of the main result (of an unconditional nature) in Sec- 
tion 3.3. Contrary to the unconditional one, which requires exact or approx- 
imate scores, the conditional result holds for any scores that satisfy (3.2). 
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Lemma 3.1. Let (p: (0, 1) — > M be a nonconstant square-integrable score- 
generating function for and let the regression constants c(^ (i = l,...,n) 

satisfy the Noether condition (N). Assume moreover that YH=i{ c t^ — c^) 2 = 
0(n), asn— >oo. Then: 

(i) (Asymptotic representation) under Ti^J, asn— >oo, 

(3.4) SW - E[^»)|NW] = T$ - Ep$|Z$] + op(l/Vn), 

where T^j := i ^i=i c i™" VO^X-^i )) (^ stands for the distribution function 
associated with f ); 

(Asymptotic normality) under li!^ 1 , asn^oo, 



(3.5) v^(4 n) -E[5^ n) |N (n: 
where < cr 2 := Jq tp 2 {u) du — (Jq 1 cfa/) 2 < oo 



i^(c^-cW) 2 ^AA(0,^), 
\ n i=i 



Observe that, under TLq , 



1 n 

E [5W| N W] = -^ c 5 n )E[E[aW(N( n );i?J n) )| S W]|NW 



n . 



i E^ ) p [4 n) = -l|N^]4o E aW(NW;j) 



1 n ~1 

+ PK W = 1 | N (n) ] ^_ J- a W(NW;j) 
^+ j=( n _JvW)+l J 



/ n 

cW i^ a W(NM;i?S n) ) 



n . 



and 



(3.6) 



E[rS|z^] = i^^E[^(F(zW))|zW] 



j=l 
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Hence, part (i) of Lemma 3.1 actually states that 
i n 

i£(cf)-cW)aW(NW;i?S n) ) 

(3.7) 

1 = 1 

(n) 

under Wq.j, as n — > oo. Note that the expression on the right-hand side 
of (3.7) coincides with the asymptotic representation of the purely rank- 
based statistic ±E?=i(4 n) -c^)aft\R\ n) ), where a^C^) are, for in- 
stance, the traditional exact scores E[(p(F(Z^ n) ))\R\ n) ] associated with the 

(n) 

score-generating function ip. The sign-and-rank statistic Sc thus asymptot- 
ically decomposes into two parts; one of them (namely, si n) - E[^ n) |N( n )]) 
asymptotically does not depend on N^ n ^ and represents the contribution 
of the ranks, while the second one (E[si n) |N( n )] - E[si n) ]) constitutes the 
contribution of the signs. Moreover, the ranks and N( n ) being mutually in- 
dependent, these two quantities are orthogonal to each other and contribute 
additively to the unconditional asymptotic variance (see the proof of Propo- 
sition 3.2 below). 

Proof of Lemma 3.1. Since the ranks R( n ) and N( n ) are mutually 
independent under TL^J, part (i) of the lemma follows from classical asymp- 
totic representation results for linear rank statistics; see [3], page 61. The 
proof of part (ii) of the lemma, in view of (3.4), simply consists in checking 

that v^(T^-E[T^ 

n }\^)]) satisfies the traditional Lindeberg condition. 

□ 

3.2. Exact and approximate scores. Following the classical literature on 
ranks, we consider in the present paper sign-and-rank statistics based on 
either exact or approximate scores. 

Let u[ n \...,U n n) be an ?i-tuple of i.i.d. random variables uniformly dis- 
tributed over (0,1). Define s[J := I[U$ n) > 1/2] - I[u\ n) < 1/2], N$_ := 
Y,Ul[ul n) < 1/2] and :=E? =1 /[^ (n) > 1/2]. Denote by B$ the 

rank of among u[ n) , . . . , U n n) , by U$_ (i = 1, . . . , v) the ith-order 

statistic associated with a sample of v i.i.d. random variables uniformly 
distributed over (0,1/2) and by uj^) + (i = l,...,u) the ith-order statistic 
associated with a sample of v i.i.d. random variables uniformly distributed 
over (1/2,1). Note that the conditional distribution of given the event 
sty = —1 (resp. Sy) = 1) is uniform over (0, 1/2) [resp. (1/2, 1)]. The linear 
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nonserial sign-and-rank statistics constructed from the exact and approxi- 
mate scores associated with <p are defined by 

S {n) , :=-yc^a {n) , (NW;#) 

c;ip;ex/appr ^ i. < i i^ex/appr \ ' i ' 

i=l 

1 n 



(3-8) :=^E^U[4 nj =-l]<i ;ex/appr (^^r 

i=l 

+ ^ = l]^^(N^-M n) - (n - 

where the score functions a£?_ ;ex , a^-- apP r> a ^"+;cx an d a ^+;appn an defined 
on the set {(u;i); v,i G {1, . . . , n} with i <u}, are given by 

(3.9) a£L ;e >;z) := E[<p{U™ )\N§_= u, < = i] = E[<p(U$_)}, 
(3-10) ^ ;appr (^):^(E[[/M])^(^), 

i) := E[<p(U^)\N^l = u, < } = (n - u) + i] 

(3.11) 
and 

(3-12) "SUpr^O == ^[^D = + ^^y) 

Observe that, under H§}, si% x = E[r^|N( n ),R( n )] = E[T^| S ( n ),R( n )]. 
We then have the following proposition. 

Proposition 3.1. Let (p : (0, 1) — ► R &e a nonconstant square-integrable 
function. Then ip is a score- generating function for affl cx - If, moreover, ip 
is the difference of two nondecreasing square-integrable functions, then ip is 
also a score- generating function for a^ip pr . 

Proof. Let us first consider the exact scores defined by relationships (3.8), 
(3.9) and (3.11), and let us show that, under H^j, 

(3.13) E[{ag x (NM;4 n) )-v,(F(Z; n) ))} 2 |NW]= p(l) 



(n) 

as n — ► oo . By the definition of a^._. ex , we only need to show that 

E[{E^(F(z( n) ))|4 n) = -l,ivi n) ,4 n) ] -v9(F(z( n) ))} 2 |Ar(™), s W = _i] 
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under Tl^J, as n-> oo. Since F(z[ n ^) is, under 1~L^ and conditionally on 

= —1, uniform over the interval (0, 1/2), this readily follows from a slight 
generalization of Theorem V.1.4.a in [3], page 157. 

Let us now consider the approximate scores defined by (3.8), (3.10) and (3.12). 
Clearly, (3.3) holds for a^i ppr if, under H^J, 

E[{a^;a Ppr (iV W ;4 n) ) " ^{F{Z^))Y\N^\s^ = -1] 

and 

E[{aS ;appr (ivf ;4 n) - (n - N™)) - ^(F(Z^))} 2 \N^\s^ = 1] 
are op(l) as n —> oo. We have 

m^- m {N^-M n) ) - <p(F(z^m 2 \N^\ s ^ = -i] 

= E[{(aW.^W ; 4»))_ W^W ;JJ (») )) 

+ (4;-;ex(^;4 n) ) " y(F(4 n) )))} 2 |^\4 n) = -1] 
< 2E[{a^ ;appr (iV^;^) - a^ ;ex (^; i^)} 2 ^, s[ n) = -1] 

+ 2ffi[{a£L.«(^ 

In view of the result for exact scores, we just consider the second term. 
Denoting by [x\ the integer part of x (x £ R + ), we may write 

n{o®- m ^- h >4 n) ) - ^U(^ n) ;4 n) )} a |Jvi B) ,4 n) = -i] 







{ Q <£;-;a PP r(-^— i *) — °V;-;ex(-^- >*)} 

1=1 

'{(afe-.-appr^; 1 + L^uj ) " V(«/2)) 

+ MV2) - a^ ;cx (iV W ; 1 + [N^u\))} 2 du 
< 2 £{a^ ;appr (ivi n) ; 1 + Livi n) nJ ) - ^(u/2)} 2 

+ 2 j\a^(N^; 1 + L^uj) - ^(V2)} 2 dn. 

That this latter quantity is op(l) follows from an obvious adaptation of 
Lemma V.1.6.a and Theorem V.1.4.b in [3], pages 164 and 158, respectively. 

□ 
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3.3. Asymptotic representation and asymptotic normality. We now can 
state, for the nonserial case, the main result of this paper. 

Proposition 3.2. Let ip : (0, 1) — ► R he a nonconstant square-integrable 
score-generating function for "S^-ex/appr an d ^ the regression constants 

c| (i = \,...,n) satisfy the Noether condition (N). Whenever approxi- 
mate scores are considered, assume that <p is the difference of two non- 
decreasing square-integrable functions. Assume, moreover, that cW =0{1) 
and Y%=i( c i ~~ c^) 2 = 0(n) as n — > oo. Let \i~ := f^ 2 tp(u) du, /i+ := 
fh 2 <p(u) du and fj, v := ^Q<p(u)du. Then, writing for either 5c-$;ex or 



/2 

in. 

c;i/?;appr • 



s (n) 



(i) (Asymptotic representation) under TL^J, asn^-oo, 



5 W _ E[5 (n)] = I £ (c f) _ cW) ¥? ( J P(Z^)) 
71 i=l 

+ [2^^ + 2^-/4 - AV j + op(Vv^ 



(3.14) i=1 



(ii) (Asymptotic normality) under TCq 1 ^, asn— >oo, 



(3.15) ^(S<"'-E[ S <»)])/ 



/T 2 n 

-fE(4 n) - c - (n) ) 2 + [ c -w(^-^)] 2 



^AA(0,1). 



Note that, in case ip is skew-symmetric with respect to 1/2 [i.e., <p(u) = 
— ip{l — u)], we have /i~ = — /x+ and /U^ = 0. Straightforward calculation 

yields gM{2-^ + 2-j^p+ - = cC n V"(l - 2-^-). The conditional 
(3.4) and unconditional (3.14) asymptotic representations thus coincide and 
reduce to Hajek's traditional one for linear rank statistics, as soon as c^ = 
o(l) (examples of skew-symmetric score functions are the location scores 
ipj := —f'lf of a symmetric distribution with density /). 

Proof of Proposition 3.2. (i) We first establish (3.14) for exact 
scores. From (3.4) and (3.6), we have 

1 n 

SfeL - E[SWJ = \ £(c^ - c (n) MF(zt >)) 

(3.16) 



n • 



+ E$») |N<»>] - E[^jj + op(1/v^). 
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Since 

E[Sg. ex |NW] =E[E[rg|NW,R( n) ]|NW] =E[r$|N< n >] 
= E[E[T^|sW]|N( n )] 



= E 

where 



lfcS")E[^(F(^))|^]|N( 



n) 



n f . 



E[^(F(zf)))|^] = J[ a W = -1] f'\{u)2du 

Jo 

+ /[<> ) = 1] ^(n)2dn 

J 1/2 

= 2/[ fl ^ = -l]/i-+2/[ a W = l] M ; 



it follows that 



r) n 
TL . 1 



(3.17) +J[ S W = 1] M +|NW] 



and 



n ^ n r 



(3.18) E[5W«|NW] - E[5W «] = c» 2—^" + 2 



#(») ^(») 



-AC - fJ"p 
n r n r 

which, along with (3.16), establish (3.14) for exact scores. 

Turning to approximate scores, we can assume, without loss of general- 
ity, that cp is nondecr easing. Since (3.16) also holds if approximate scores 
are substituted for the exact ones, it is sufficient, so that (3.14) holds for 
approximate scores, to show that the difference 

:= {E[Sg. appr |NW] - E[Sg. appr ]} 

(3.19) 

-{E[5g. ex |NW]-E[5g. ex ]} 

is op(l/y/n). Note that 
E\S [n) |N (n) l 

(3.20) =^ E»f-^-) + E^ + 



i r V 2(ivi n) + 1) J p[ V2 2 (iv| n) + 1) 
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,s( N (n) iV| n) , I 

= ^n)) 2 ^ D - +2 _j_ D + I 

1 n N { ™> n N { ™> J 

where D~ := ± YJJli ^(^d+Tj) and D ™ ■= 2k T,?=i ¥>(5 + 2(^17) are Rie- 
mann sums for the integrals fi~ := J^ 2 <p(u) du and := fh 2 (p(u) du, re- 
spectively. Since ip is square-integrable, any term in the Riemann sum 
2mS^=iV 2 (5 + 2(m+i) ) assoc i a ted with J^ 2 ip 2 (u) du is o(l) as m — ► 00. 
This implies that 2^y(^ + 2(m+i) ) * s °(V\A™)> hence, in view of the fact 

that 7V| nJ = P (n), this implies that + 2(jV (n )+1) ) = op(l/y/n) as 

n — > 00 . The same reasoning shows that any finite sum of Riemann terms in 
^~{ n ) or Djj(n) actually is o-p{l/yfn) as n — ► 00. 

Now, any Riemann sum for fi^ satisfies, since ip is nondecreasing, the 
double inequality < D+ < D+, where D + := g^E^Ms + 2(^+1)) 
and 5+ := ^ Ejli ^(5 + 2(m+l) ) are ^ e u PP er an d lower Darboux sums 
associated with Jy 2 p(u) du. The difference I)+ — I2 m clearly is 2^(^(5 + 
2(m+i) ) — ^C^))) which is o(l/y / m) as m — > 00. Hence, for any Riemann 
sum, B+ - „+ is also o( WH), so that Dj^ - $ = „r(l /v /H) as „ oo. 

Furthermore, since the sequence -D+ — /i+ converges to zero, it is bounded, 

N < - n ' 1 

so that D + {n) — fit is uniformly integrable and E[— ±— D + (n) — ^mJ] = °0-/V^) 
as n — > cxjy+ ^+ 

A similar reasoning of course holds for D~ ^ and //~. Going back to (3.20) 

and recalling that c^ n ^ = 0(1), we thus obtain the desired result that 
is op(l/y/n). This completes the proof of part (i) of the proposition, 
(ii) As for asymptotic normality, elementary calculations yield 



nc (n) [ 2—fi- + 2^/4 - ii v 

n T n t -r 



N (n) N (n) 



4' 



which, since ^)/-\/l/4n is asymptotically standard normal, is also 

asymptotically normal with mean zero and asymptotic variance [c^ n \u~ — 
ftp)] 2 - The remark (right after Lemma 3.1) on the orthogonality between 

the two parts of the asymptotic representation of completes the proof. 
□ 



(»> 
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Test statistics related to "regression coefficients" naturally involve "re- 
gression constants" c- n ^ that are not all equal. Quite on the contrary, 
test statistics related to location and intercepts do not involve any 
constants — more precisely, they are still of the form Sc , but with con- 

(n) 

stants c\ all equal to 1. Proposition 3.2, as it is stated, does not apply. 

(n) 

However, going back to the proof, one easily checks that, letting S^.^i T := 
^ Ya=i ^cx/annr (-^"^ ' ^1"^)' un der the same assumptions on the scores tp, 



/CYc( n ) —E\S^ ]) 

» v 95;cx/appr I- <p;ex/apprJ / 
N (n) N (n) 

(3.21) =2-=-/,- + 2^-/4-/^ + ^(1) 



■AT(0,(/x--/i+) 2 ) 



under t6q \ as n — > oo. 



3.4. Example: median regression. The central sequence (2.3) takes the 
form A^ n) = Aj. n) (0) = v 7 "^]/;!,^"?/^)' with ( usin S the notation of Sec- 
tion 3) 



^/ ; i^£^/(^ n) )), 

1 n 



i=i 



and ff(u) := —f~{F 1 { u )),u£ (0, 1). Instead of an arbitrary score-generating 
function, we therefore focus on iff. Define 

<!i;ox:=E<! J;1 |NW,RW] 

and 

<kex^=E[r^ /;2 |NW,RW]. 
Straightforward calculations lead to 

N (n) N (n) N (n) _ ^(n) 

2 —^f + 2 ^r< - ^ = 2 /(°) n ~ . 

so that Proposition 3.2 and (3.21) yield 



E[A^|NW,RW]=V^( S ;1;cx ] =A^* + 0P (1) 
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under P^, as n — > oo, where 



(n)* 



2/(0) 



n 



^ (c W _ c^iFiZ^)) + c»2/(0) 



/ 



is a version of the semiparametrically efficient central sequence for 6 in the 
semiparametric experiment Sq . This latter statement can easily be checked 
using standard tangent space calculations. Similarly, in view of (3.5) and 
Proposition 3.1, the approximate score version of the same semiparametri- 
cally efficient central sequence is 



A 



(n)* 



2/(0) 



N (n) _ N (n) 



n 



\ 



x i=i 
with, for £ = 1, ... ,n, 



1 JO. 



n 



(3.22) 



s \2 2 (iv| n) +l 



R 



(n) 



2(ivi n) + 1 



1 K 



(n) 



(n-ivj n) ) 



This central sequence, which is measurable with respect to the residual signs 
and ranks, can be used to perform semiparametrically efficient inference 
(tests, estimation, etc.); see, for example, Section 11.9 of [16]. For a full treat- 
ment of sign-and-rank-based versions of semiparametrically efficient central 
sequences in median restricted models, we refer to [10]. 

4. Serial linear sign-and-rank statistics. 



4.1. Definition and conditional asymptotic representation. Nonserial sign- 
and-rank statistics, just as their traditional rank-based counterparts, are in- 
efficient in the context of dependent observations: Only serial statistics can 
capture the effects of serial dependence. Define a linear serial sign-and-rank 
statistic of order k (k G {1, . . . , n — 1}) as a statistic of the form 



S, 



in) ._ 



1 



n — k 



(«) \ 
t-kh 



t=k+l 
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where ajj. (•;•,..., •) is defined over the product of the set {(u, r]);v,r] £ 
{0, 1, .. . ,n},rj < n — v} with the set of all (k + l)-tuples of distinct inte- 
gers in { 1 , . . . , n} . The asymptotic mean and variance of are given in 
the subsequent Proposition 4.1. 

Here also an asymptotic representation result is proved, establishing the 
asymptotic equivalence between and a "parametric" serial statistic . 

The asymptotic normality of then entails that of . A function 
ifk '■ (0, — > R is a score-generating function for the serial score function 

4 n) if 

E[{af (N^;Rtiv ■ ■ • >4 n) ) " MF(Z^), . . . , F(Z{")))} 2 |Z^] 

(4.1) 

= o P (l) 

under Ti.^J, as n — > oo. Once more, (4.1) automatically holds if, under fi^l, 

E[{4 n) (N w ; Rttv - , 4 n) ) - Wffi), • • • ,F(^ n) ))} 2 |NW] 

(4.2) 

= o P (l) 

as n — > oo. We then have the following conditional asymptotic representa- 
tion and asymptotic normality results, which are the serial counterpart of 
Lemma 3.1. 

Lemma 4.1. Let (p k : (0, l) fc+1 -> R 6e a score-generating function for 
a k . Inen: 

(i) (Asymptotic representation) under Ti^j, as n — > oo, 
(4.3) of > - E[sf >|NW] = r£> /;fc - E[T^. f;k \Z^} + o P (l 

i n 



n fe t=fc+l 



and 



= [n(n-l)---(n-k)}- 1 wW) 

l<*i^-#*+i<n 

(ii) (Asymptotic normality) i/, moreover, < ^k+i |^fc(ufc+i, . . . , 
iii)| 2+<5 <iui • • • duk+i < oo for some 5 > 0, i/ien, under t6q \ as n ^ oo, 

v / ^fc(5l n) -E[5l n) |NW]) ^AA(0,V 2 ), 
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where, denoting by XJ\^XJ^... an i.i.d. sequence of standard uniformly dis- 
tributed random variables, 



+ 2 £ E[<p* k (U k+1 , l7i)^(C7fc + i +ij . . . , U 1+j )] 



with, for m,.. -,u k+1 G (0, 1), 

:=V9 fc (n fc+ i,...,ni) 
fc+i 



- J2Mvk(Uk+u---,u 1 )\u l = u 1 }+m[<p k (u k+1 ,...,u 1 )}. 



Proof. To prove part (i) of the lemma, we only need to show that, 



under as n -» oo, E[{Z)jf ; } 2 |Z 7] = o P (l), where 



Since the maximal invar iant (N( n ),R( n )) depends on Z/x only through 



NW, we actually have E[{D^} 2 |Z^] = (n - jfe) Var[5^ n) - T^. f . k \Z^]. 
Conditionally on zi n ? (and hence on N^" - )), S k — T^K. k is a linear serial 



rank statistic in the sense of Hallin, Ingenbleek and Puri [5] . Corollary 2 of 
Lemma 2, and Lemma 4 (Appendix 3) of that paper imply that there exists 
a constant K (not depending on n) such that 



By (4.1), the last term converges to zero in probability under as n — > oo, 
which completes the proof of (4.3). 

The asymptotic normality of y/n^k(T^.j. k - E[T^ /;fe |Z^]) [part (ii) 

of Lemma 4.1], hence also that of Vn^k(S { k n) - E[5^|N^]), is also es- 
tablished in [5]. The special form of V 2 follows from Yoshihara's [20] central 
limit theorem for ^-statistics under absolutely regular processes, which re- 
quires the (2 + <5)-integrability of the score-generating function ip k . □ 



E[{^(C/ fc+1 ,...,C/ 1 )} 2 ] 



(4.4) 



k 



1=1 




xEKaftNW;^,...,^) 

- Vk (F(Z^ 1 ),...,F(Z^))} 2 \Z^]. 



SIGN-AND-RANK STATISTICS 21 



Note that the right-hand side in (4.3) is exactly the same as in the asymp- 
totic representation of the purely rank-based serial statistic 

Jki v *u+i' '-+1; 



E 



Vn + 1' 'n + iy 



t=fe+i 



This remark, which is analogous to the remark made in the nonserial case 
just before the proof of Lemma 3.1, will play a crucial role in the proof of 
the asymptotic normality part of Proposition 4.1(h). 

4.2. Exact and approximate scores. As in the nonserial case, two types 
of scores — the exact and the approximate ones — are naturally associated 
with a given score-generating function. Define (referring to Section 3.2 for 
notation) 

1 n 

</> fe ;ex/appr - — „ _ u U <p k ;ex/appr V 1 ^ ) *h > -"t-l ) • • • ? ^t-kJ' 

11 ft t=k+l 

where, for (77, v) £ {0, 1, . . . , n} 2 , v <n — r\ and 1 < zi 7^ 22 7^ • • • 7^ h+i < n ; 
air fc ) ;ex((^> I/ );n>--->^+i) 

:= E[<p k (U<f\ ■ ■ • , = 17, <l = ^ 

K Vi - i 1} . . . , K Uk+i -i k +i\ 



and 

'*V fc ;appr 

„. ^i?rrr( n )i f\r( n ) _ /\r( n ) _ 

. . . , EfO^L = V , N™. = „, <) i = i k+1 }) 



fe+i 

\ \ 2(77 + 1)/ L J V2 2(i/ + l 

...,I[ik+i <v]( 



,2(77+1] 

+ I^fc+i > n - v\ - + 



,2 2(i/ + 1) 

The following lemma provides sufficient conditions for to be a score- 
generating function for a^ ;ex and for a^ ;a pp r - 
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LEMMA 4.2. Let ip k : (0, 1) + — > M be nonconstant and square-integrable. 

Then (p k is a s core- generating function for a^ ;cx - If, moreover, ip k is a linear 
combination of a finite number of square-integrable functions that are mono- 
tone in all their arguments, then cpk is also a score- generating function for 

(n) 
°Vfc;appr- 

Proof. The proof easily follows along the same lines as in the nonserial 
case and is left to the reader. □ 



4.3. Unconditional asymptotic representation. Lemma 4.1 was only an 
intermediate, conditional result; the following proposition provides the cor- 
responding unconditional asymptotic representation and asymptotic nor- 
mality. Define 



AVfc := / ipk(u ,---,u k )du ---du k , 

J[0,l] fe+1 

$1 := / fk(uo, ...,u k )du --- du k , 

J[0,l/2] k + 1 

(fe+i) . = / p k (u , . . . , u k ) du ■ ■ ■ du k 



'[l/2,l] fc+1 



and, for u = 1, 2, . . . , k, 



0<«l<---<j,y<fe 



(u H ,...,u^ )£[1/2A] , J(u j ,0<j<k,jjti u ...,i^e[0,l/2] k + 1 -" 

(p k (u ,...,u k )du --- du k . 

Proposition 4.1. Let p k be a nonconstant square-integrable score-gene- 
rating function for <S^. eX/ / appr • Whenever approximate scores are considered, 
assume that (p k is a linear combination of square-integrable functions that 
are monotone in all their arguments. Then, writing si for either S<p";ex or 
q( n ) 

*-Vfc;appr • 

(i) (Asymptotic representation) under H^J, as n — >oo, 
SJ B) -E[SW] 

+ 2 k+1 [n(n- l)-.-(n- k)}' 1 
x \l[N M >k + 1]N^\n^ -!)••• (N^ ] - 
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+ Y J I[k + l-v< N { l n) <n-v] 

x N H (N H N^-k + u) 

x iv| n) (jvf } - 1) • • • (ivj n) - 1/ + 1)^ 

+ I[ivj n) > fc + i]ivj n) (ivj n) - 1) • • • (iv| n) - k)$ h +1) | 
+o P (i/v / «); 

(ii) (Asymptotic normality) i/, moreover, cp k is (2 + 5)-integrable for some 
5 > 0, then, under TL^, as oo, 



(4.6) 



Vn—k(S^-E[S^})/ 



■Af(0,l), 



\ 



V 2 + {k + l) 2 



fc+i 



with V 2 given in (4.4). 



When the score ip k is skew-symmetric with respect to 1/2 [i.e., (p k (uo, . . . , 
Ui,...,u k ) = -cp k (u , . . . ,1 - Ui,...,u k ) for all i = l,...,k], then n Vk = 

E^UH = 0with 



k + l 



[l/2,l]"x[0,l/2] fe + 1 -' y 



(p k (u ,...,u k )du --- du k , 



so that 



fc + 1 

1 



This and the fact that N_ jn — i = Op(n x / 2 ) implies that the right-hand 



2 



u {2) 



jfe + 1 
3 



side of (4.5) reduces to T^ ;f . k - E[r^ /;fe |Zj^] +o P (l). Hence, the condi 



(n) liyCn)-! 



tional (4.3) and unconditional (4.5) representations of — ~E[S^ l> ] coin- 
cide. 



Proof of Proposition 4.1. As in the nonserial case, we first prove 
the asymptotic representation result for exact scores. From the definition 
of exact scores, we obtain, for S% = S^-cx, writing for T^.^. k : = 



^EUiw(^ B) ) TO. 



E [ 5 W| N W] 
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= E[E[r^ n) |R( n) ,N( n )]|N^] = E[7\ (n) |N( n )] 
= E 



-i- J2 nMF(zi n) ),...,F(ztl)W n \si n \...,si%]\N^ 



t=k+l 



where 

E[MF(zi%...,F(Z^l))\N^\sl n \...,st{] 



= 2 fc+i / (p k (u ,...,u k ) 

J[o,i] fe+1 

(4.7) 

x 7[sign(u - \) = sf\ 

sign(u fc - |) = s^ fc ] du • • • <^fc- 

The asymptotic representation (4.5) (for exact scores) follows by combin- 
ing (4.7) and part (i) of Lemma 4.1. Turning to approximate scores, it is 
sufficient for (4.5) to hold that 

(4.8) : = {E[5g appr |N<»>] - E[5^ appr ]} - {E[S^ CX \N^] - E[S$. ex ]} 
be op(l/y / n). Note that 
E[Sffi appr |NW] 

= [n{n - I) ■ ■ ■ {n - k)}~ 1 

x E-E 

l<ti^— ^i fc+ i<n 

^ r < ^ } ] r -^s— i + /fc > Aa r \ + 



2(iv! n) + l)/ V2 2 (iv| n) + l) 

I[i k+1 <N^]( ^ ) 
1 l \2(N {n) + l)J 



rr- ^ wWi/ 1 , *fc+i -JV- 
V2 2(Af| j + l) 

For notational simplicity, let us consider the case k = 1; the general case 
follows along the same ideas. For fc = 1 we have 



E[^ appr |N(")]-E[5(™) ex |NW] 



n(n " 1} li<^W V 2(AT(") + 1) ' 2(tfW + 1) 
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^ (n) n (n) 

i j- Nr } 



+ E E M n ,„M . ^ >o + 



i=1 j=Ar(™)+l 



2(ArW + i) ' 2 2 (^| n) + 1) 



(«) 

1 i - 



/l i-ivi n) 1 j-ivi n) 



Ari n) +l<j^j<n 



2(iv| n) + 1) ' 2 2(iV|" ) + 1) 



//rjvW > 2]ivi n) (ivi n) - 1)^(0) 
n(n — 1) ^ 

+ J[l < M n) < n - l]Ari n) ^V (1) 



+ /[ivj n) > 2]ivj n) (ivj n) - 1)$}} 
4ivi n) (ivi n) -l)+f (iV W ) 2 

^jvrW Art") ~ ^ 



n(n - 1) | Ar^)(ivi n) - 1) Ar - > N 



Ari n )(ivi n) -l)4(ivi re) ) 5 



JV (n) 



E pi 



i ' "V2(Af W + l)'2(ivi n) + l) 
+ n(n-l) {i V\7v("> *V } + n(n-l) { °N^,N^ ~ *V } 



n(n - 1) } jy-W ( iv| n) - 1) N + > N + 



Vl 



(iv| n) ) 2 1 
ivf } (iv| n) -l)4(iv| n) )2 

A /l i 1 

x E yi k + n/ , rM , + 



8=1 



2 2(Af| n) + 1) ' 2 2(7\d n) + 1) 
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where x + := max(0,a;), 

i £ m r • 



D e,m : ^^^^(f+l)' 2(m + l)J' 
1 ^ m ( % 1 ' 



Mmfri^ \2(£+l)'2 2(m + l) 
1 < ™ /i i j 



i=ij=i 

£ m 



4^m^^ 1 V2 + 2(^ + l)'2(m + l) 



A" 



1 /l i 1 



m ' 4^m^^ r V2 2(^ + 1) '2 2(m + l) 



are Riemann sums for the integrals 

,1/2 ,1/2 
,1/2 ,1 

Vvi := J J x 2 <Pi{uo,ui)duodux, 

,1 ,1/2 

M^i := J 1/2 J t Pi{uo,u 1 )du dux, 
//++:= / / ^i(u ,ui)dw d«i, 

^ J1/2J1/2 

respectively. Here again, due to the fact that 9^1 is square-integrable, the 
function (u, t>) 1— ► y?*(u, := c^i(it, v)/[u = v], (u,t>) G [1/2, l] 2 , which van- 
ishes except over the diagonal of the unit square, is integrable and has in- 
tegral zero. Hence, (l/4m 2 ) YT=i ¥>i(§ + 2 ( m +i) ' 3 + 2(m+i) )> as a Niemann 
sum for the integral of (p\ over [1/2, l] 2 , is o(l). Since 



1=1 



1 

+ -TTT,» + 



2 2(m + l)'2 2(m + l) 



i=l 



2 2(m + l)'2 2(m + l) 



it follows that (l/4m 2 )E^i^i(| + 2(^+Ty^+ 2(^17) is o(l/Vm),asm-» 
00. A similar result holds for (l/4m 2 ) X)E=i yi ( 2{m+i) ' 2(m+i) )' as we ^ as ' °^ 
course, for any individual terms such as (l/4m 2 )</?i(^ + 2 {m+i) ' 2 1 + 2(m+i) )• 



SIGN-AND-RANK STATISTICS 



27 



Thus, (4.9) as n — > oo takes the form 

E[^ appr |NW]-E[5(") cx |NW] 

4iyW(iyW_l)+ r 

n(n-l) [D n<?\nW 

+ n(n-l) 1 Jvi'V™ *V J 

+ n(n-l) 1 Jv£Vi"> " ^ J 

4iv| n) (iv| n) -l)+ r , , ++1 
+ + re ( n -l) [D %W ~ ^ ] + ° p(1/ ^ } - 

Considering the difference — iz j£+ , we have 

1 m m f 1 i 1 



(4.10) 



4m 2 f-ff-J ri \2 2(m + l)'2 2(m + 1) 



</?i(ito, «i) duo <iui 

1/2,1] 2 

E r /l Z 1 J 

2- WU + 9^X1V9 + 



4m 2 frf f-f r V2 2(m + l)'2 2(m + l) 
y?i(ito,Mi) du dui + o(l/ v / m), 

[1/2.1P 

because, in view of the same argument as above, the two first sums in (4.10) 
are o(l/y/m). As in the proof of Proposition 3.1, due to the fact that ipi can 
be assumed to be nondecreasing in its two arguments, the sum that appears 
in this latter expression is composed between the two Darboux sums 

1 fl i-1 1 j-l 



and 



4m 2 f-f f-{ r V2 2m '2 2m 

i=l J= i 



m ' m: 4m 2 ^ ^ ^ 1 U + 2m'2 + 2mJ' 
These Darboux sums also converge to the integral JJ^/ 2 i]2 ¥>i(^o, n i) du\ 
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D2 



D ++ 



4m 2 



1 m — 11 m — 1 



^ ll 2 + 2m '2 



r + 



2m 



1 1 

^ ll 2'2 



the same argument still implies that this difference, hence also — //^, 

is o(l/y / m). The other three quantities of the same type can be treated 
similarly. Uniform integrability and the fact that N±^ are Op(n), as in the 
proof of Proposition 3.1, complete the proof that (4.8) is indeed op(l/y / n). 

To conclude, we now prove the asymptotic normality result. Denote by 
Ufc + i the set of permutations tt of {1, . . . , k + 1}. Then 



n 
k + l 



'k+l 



E-E E 

l<ti< - <t k+1 <n Ki/=0 



1 



(fc + l)! 



nfc+l >) 



x E /[-C^i,..,^^!, 

7ren fe+ i 



» 



'^■■■> s Llk+D - 1 U ; 



hence E[5^ ;ex |NW] is a [/-statistic computed from the n-tuple Z^ \ . . . , Zn ^ 
with kernel 

h k (zi, Zk+i) 



k+l k+l,.(v) 
\ -> f Fyfc 



X E > °> • • > «»r(«/) > °> *>r(H-l) < °> • • > ^(fe+l) < 0] • 

fen H1 

(n) 

Routine calculation yields, under 7~C .f, 
E[h k ( Z [ n \...,zi%)\Z^] 

k+l k+l , 1 

= ^ > o] E + ui4 n) < o] E 

and 

{fe+i ~) 
^-^EfcTI*} 
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which is strictly positive. Classical results on [/-statistics (see, e.g., [19]) 
then imply that, under Tt^J, as n — > oo, 

(n-fc)V2( E [4^ ex |NW]-E[5W ex ]) 

The same argument as in the nonserial case can be invoked to establish the 
asymptotic independence of the right-hand side in the conditional asymp- 
totic representation (4.3) and (n - fc) 1 / 2 (E[5£ ) ;e x|N(")] - E[Sj$eJ). The re- 
sult follows. □ 

4.4. Example: first-order median moving average. The central sequence 
(2.5) under clearly [central sequences are always defined up to op(l) 
quantities] can be rewritten as 

Aj t) = v^Trg + 0p (l), 



where, defining <pf(u) := -jr (-F 1 (u)) and ipf(u) := F 1 (u), u£ (0, 1), 



» ._ 



1 n 



^ 71 - t=2 

a measure of serial dependence associated with /. With this notation, it 
clearly appears that is a particular case [letting k = 1 and ip\(uo,ui) := 

ipf(uo)ipf(ui)] of the statistic T^}f. k considered in Lemma 4.1. 

Define the serial linear sign-and-rank autocorrelation statistic of order 
1 (based on exact scores) as Zj^i*cx := E[rjvj'|N( n ), R,( n )]. Proposition 4.1 

implies that, under Pj^j, as n — ► oo, 



with 

(n)* _ (n) Fr r ( n )|7( n )i 

r /;l ~ r /;l ~ tj i r /;ll Z (.) J 



+ 



2 2 



n(n — 1) 



I[N^ > 2]N {n) (N {n) - l)(-/(0)) / xf(x) dx 
+ /[l<ivl n) <n-l}N { ^N^ 
x (-f(0)J™xf(x)dx + f(0) £^xf{x)dx S j 
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+ /[iV| n) >2]/(0) r X f{x)dx 
Jo 

+ op(l/y/n) 

N tn) _ N (n) 



r 



g - E[rg |ZW] + 2/(0)/, /'+ n " + op(1/v^) 

-—ik~ [ t ' , - 1 "^) 1 <t,k~ " b 

+ 2/(0)/i/^ — + op(Vv^) 

re 

= -^tE -r(^ B) )(^ffl - a 1 /) + 2/(0 w + + qp(i/v?i). 

re i t=2 j re 

Letting 

(4-11) :~E^(#W/(#i) 

" 1 i=2 

with given in (3.22), the approximate score counterpart of r j""]* ex is, in 
view of Lemmas 4.1 and 4.2, 

r?£ ppr =rg; aPPr - Efcg ;appr |NW] + 2/(0)^ jV+ 

1 n 

1 ^ '->(") N„/, /n(«) 



(4-i2) =^ I E^/(^r)^(5n; 



t=2 

1 ^ ^ N (n) - 7V (n) 

-^ry EE ^/(gir ) )^(g& ) )+2/(ow iV+ ■ 

In conclusion, defining Ay * := \/n — lr^* and Aj n ]*/ appr := \Jn — \ x 
r( n )* , we obtain 

/;l;ex/appr 

Af" = Ag: + op(l) = At + «p(l) 

under Wq™^, as n — > oo. Using standard arguments, one easily verifies that 
A, . Af"i* and A^l* are indeed three versions of the semiparametrically 
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efficient central sequence for 9 in the model £g '; again, the sign-and-rank- 
based Aj^ppr can ^ e use d to perform semiparametrically efficient inference 

(tests, estimation, etc.) for the MA(1) coefficient 9; see, for example, Sec- 
tion 11.9 of [16]. 

5. Numerical study. The finite-sample performance of the proposed test 
statistics has been studied in the context of the first-order moving average 
model of the example in Section 4.4. More precisely, we generated 1000 
replications of each of the MA(1) processes characterized by equation (2.4) 
with parameter values 9 = ±0.3, ±0.25, ±0.20, ±0.15, ±0.10, ±0.05 and 0, 
and the following asymmetric innovation densities: 

(a) f(z) := ftj[z < 0] + ftf(p t i)(z)I[z > 0], where f tl stands for the Cauchy 
density and /jv"(o,i) f° r the standard normal one; 

(b) f(z) := ft 5 I[z < 0] + fj\f(o,i)( z )I[ z > 0]> where ft 5 stands for the Student 
density with 5 degrees of freedom; 

(c) / := /at a= _i (the skew normal density with skewness A = —10; see [1]), 
duly shifted and rescaled to have zero median and unit variance; 

(d) / := /a/" a= _2o (t ne skew normal density with skewness A = —20), duly 
shifted and rescaled to have zero median and unit variance; 

(e) / := 0.5/yv"(o,i) + 0-5/aT(-5,2) ( a mixed-normal density), duly shifted and 
rescaled to have zero median and unit variance; 

(f) / := 0.75/^(0,1) ± 0-25/_a/(-5,i) (a mixed-normal density), duly shifted 
and rescaled to have zero median and unit variance. 

For each replication, randomness (namely, 9 = 0) has been tested against 
first-order moving average dependence (two-sided test), based on the asymp- 
totically normal distribution of: 

(i) the ordinary first-order autocorrelation coefficient 

n n 

r W : = („ - l)" 1 £(Z t - zW)(Vi - Z^) /n' 1 J2(Zt ~ Z^f; 

t=2 t=l 

(ii) the "traditional" first-order van der Waerden rank autocorrelation co- 
efficient 

( n / f?( n ) \ / ??( n ) 



t=2 



n + 1 / \n + 1 



[n(n-i)]- 1 yy —— $ 

U+i/ Vn+1 



1 U-if 3 U/^) 



vdW;l' 



where $ stands for the standard normal distribution function and 
^vdw-i s t an ds for the exact standardizing constant (see, e.g., [8]); 
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(iii) the "traditional" first-order Wilcoxon rank autocorrelation coefficient 

- Nn - I)]- £X>o E (^>o g (^) }/«& 

with if\ og (u) :=2u—l and ip\ og (u) := ln(j-r^), u G (0, 1) (^log is propor- 
tional to the inverse of the logistic distribution function) ; ct^i stands 
for the exact standardizing constant (see, e.g., [8]); 

(iv) the "traditional" first-order Laplace rank autocorrelation coefficient 

$-{c-i>-'t^(^M^) 

-H»-r'EE^U)*.,(^))/{l 

l<«7=7<ri ) 

with <^exp(u) := sign(2u — 1) and 

VwO) := ln(2it)I[u < 0.5] - ln2(l - u)I[u > 0.5], u G (0, 1) 

(■i/'exp is proportional to the inverse of the double-exponential distribu- 
tion function); stands for the exact standardizing constant (see, 
e.g., [8]); 

(v) the first-order sign-and-rank autocorrelation coefficient Tw/VdW-i 

fined in (4.12), with the approximate scores tpf(u) = ~ < Plog( u )I[u < 
0.5] + (j)- l {u)I[u > 0.5] and ^ f (u) = ^i og (u)I[u < 0.5] + (j)- l {u)I[u > 
0.5] associated with a density /(z) := ± I[z < 0] + fj^ {0jl) (z)I[z > 

0] (with 7 := that is logistic on the negative half-line and stan- 

dard normal on the positive half-line (yielding Wilcoxon scores for the 
negative residuals and van der Waerden scores for the positive ones); 

(vi) the first-order sign-and-rank autocorrelation coefficient ZL/vdW-1 

fined in (4.12), with the approximate scores 

ip f (u) = —I[u < 0.5] + (t)- l (u)I[u > 0.5] 

7 

and 

iP f (u) = ji/j exp (u)I[u < 0.5] + -1 (u)I[u > 0.5] 



SIGN-AND-RANK STATISTICS 



33 



1 

0.9 


- 






A 




















0.8 
0.7 
0.6 


























0.5 














0.4 








LShL. 






0.3 
































0.2 
0.1 















) autocorr 














-»- ! 

+- ( 
-*- < 


i) vdW 

ii) W 
v) L 












-e- (v) W/vdW 
-x-- (vi) L/vdW 





-0.4 -0.3 -0.2 -0.1 0.1 0.2 0.3 

theta (the parameter of the MA(1) model) 



Fig. 1. Empirical power, under Cauchy /standard normal innovations (a), of various 
parametric, rank and sign-and-rank tests for randomness against first-order MA depen- 
dence [based on the test statistics (i)-(vi)]. The series length is n = 250; 1000 replications 
were performed. 

associated with a density f(z) := ^ exp(z/j)I[z < 0] + fj\f(o,i)( z )I[ z > 
0] (with 7 = y/Tr/2) that is double-exponential on the negative half-line, 
and standard normal on the positive half-line (yielding Laplace scores 
for the negative residuals and van der Waerden scores for the positive 
ones). 

The results of these simulations (series length n = 250; number of replica- 
tions 1000) are summarized in Figures 1-6, where the graphs of the empir- 
ical power functions associated with testing procedures (i)-(vi) are plotted 
against 9. 

These graphs speak for themselves and need little comment. They all 
clearly demonstrate the superiority, under asymmetric densities, of the sign- 
and-rank methods over both their classical Gaussian and traditional rank- 
based competitors. The more skewed the underlying density, the more sig- 
nificant the improvement. For instance, in Figure 1 [Cauchy /Normal den- 
sity (a)] the percentage of rejection at 9 = —0.05, which is only 0.0240 for 
the traditional correlogram-based tests (a severely biased test, thus), is as 
high as 0.7720 for the sign-and-rank Laplace/van der Waerden tests (vi). At 
9 = —0.10, the corresponding figures are 0.2460 for the correlogram-based 
tests, but 0.9770 for the Laplace/van der Waerden ones. Of course, the 
performance of the parametric correlogram method in this case is partic- 
ularly poor, due to the absence of finite moments, but the superiority of 
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Fig. 2. Empirical power, under Student (5 d.f.) /standard normal innovations (b), of 
various parametric, rank and sign-and-rank tests for randomness against first-order MA 
dependence [based on the test statistics (i)-(vi)]. The series length is n = 250; 1000 repli- 
cations were performed. 
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Fig. 3. Empirical power, under skew-normal (\ = innovations (c), of various para- 

metric, rank and sign-and-rank tests for randomness against first-order MA dependence 
[based on the test statistics (i)-(vi)]. The series length is n = 250; 1000 replications were 
performed. 
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Fig. 4. Empirical power, under under skew-normal (X = —20) innovations (d), of vari- 
ous parametric, rank and sign-and-rank tests for randomness against first-order MA depen- 
dence [based on the test statistics (i)-(vi)]. The series length is n = 250; 1000 replications 
were performed. 
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theta (the parameter of the MA(1) model) 



Fig. 5. Empirical power, under mixed normal innovations (e), of various parametric, 
rank and sign-and-rank tests for randomness against first-order MA dependence [based on 
the test statistics (i)-(vi)]. The series length is n — 250; 1000 replications were performed. 



the sign-and-rank-based methods over their "purely rank-based" competi- 
tors remains quite substantial (at = —0.05 and 9 = —0.10, Wilcoxon tests 
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Fig. 6. Empirical power, under mixed normal innovations (f), of various parametric, 
rank and sign-and-rank tests for randomness against first-order MA dependence [based on 
the test statistics (i)-(vi)]. The series length is n = 250; 1000 replications were performed. 

only yield empirical powers 0.4360 and 0.8250). Quite understandably, this 
superiority of the sign-and-rank methods over their competitors fades away 
under moderately skewed densities (see Figure 2, where it is less pronounced 
than in Figure 1), but it remains extremely substantial in Figures 4-6. 
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